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ABSTRACT 

Taking  Albert's  (1976)  formulation  of  a  mixed  model  we 

consider  Improved  estimation  of  the  variance  components  for 
balanced  designs  under  squared  error  loss.  Two  approaches  are 
presented.  One  extends  the  ideas  of  Stein  (1964).  The  other  is 
developed  from  the  fact  that  variance  components  can  be  expressed 
as  linear  combinations  of  chi-square  scale  parameters. 

Encouraging  simulation  results  are  presented.  '  ' 


1 .  INTRODUCTION  ' 

Albert  (1976)  exhibits  necessary  and  sufficient  conditions  for 
a  sum  of  squares  decomposition  under  a  mixed  model  to  be  an  ANOVA, 
i.e.,  for  the  terms  of  tie  decomposition  to  be  independent  and  to 
be  distributed  as  raultipl’s  of  chi-square.  We  consider  improved 
estimation  of  the  variance  components  under  squared  error  loss  in 
such  a  set-up.  We  make  no  attempt  to  discuss  the  enormous 
literature  on  this  problem.  See  Harville  (1977)  for  such  a 
review.  Rather,  we  specialize  to  the  "balanced"  case  considering 
designs  consisting  of  crossed  and  nested  classifications  and 
combinations  thereof.  Rules  of  thumb  for  formalizing  the 
associated  ANOVA  table  are  thus  well  known  (see,  e.g.,  Searle 
(1971,  Chap.  9)).  Customary  estimators  of  the  variance  components 
are  the  unbiased  ones  obtained  as  described  in  Searle,  pp.  405-6. 
Under  normality  these  estimators  are  UMVU  (Grayblll  (1954), 
Graybill  and  Wortham  (1956))  and,  in  fact,  restricted  maximum 
likelihood  (REML)  (Thompson  (1962)).  However,  positive  part 
corrections  are  usually  taken  yielding  Improved  mean  square  error 
but  sacrificing  these  "optimalities.”  Bayesian  approaches  to 
variance  component  estimation  in  this  setting  are  discussed  in, 
e.g..  Hill  (1965)  and  Box  and  Tiao  (1973). 

Since  the  positive  part  estimators  are  not  smooth  and,  thus, 
not  admissible  under  squared  error  loss  (SEL),  it  is  natural  to 
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seek,  dominating  estimators.  The  earliest  work  of  this  type  is  due 
to  Klotz,  Milton  and  Zacks  (1969)  for  the  one-way  layout.  They 
show,  for  example,  that  the  MLE  of  the  "between"  variance 
component  (see  Herbach  (1959)),  which  is  itself  a  positive  part 
estimator,  dominates  the  UMVU  and,  using  ideas  of  Stein  (1964), 
that  it  in  turn  can  be  dominated. 

The  objective  of  this  paper  is  to  describe  two  general 
approaches  for  creating  improved  estimators  of  the  variance 
components  under  SEL.  In  Section  2  we  develop  a  method  by 
extending  the  aformentioned  Stein  idea.  We  have  discussed  a 
special  case  of  this  approach  in  linear  regression  models  in 
Gelfand  and  Dey  (1987b).  Since  the  variance  components  are  linear 
combinations  of  chi-square  scale  parameters,  we  can  draw  upon  some 
literature  for  improved  estimation  of  linear  combinations  of  scale 
parameters.  This  second  approach  is  offered  in  Section  3.  Work 
of  Dey  and  Gelfand  (1987)  for  arbitrary  scale  parameter 
distributions  and  of  Kloneckl  and  Zontek  (1985,  1987)  for  the 
Gamma  family  of  distributions  is  pertinent  here.  Finally,  in 
Section  4  we  present  some  simulation  results. 

In  the  remainder  of  this  section  we  develop  notation  for  and 
features  of  the  model  we  will  be  working  with.  Consider  the 
general  balanced  mixed  model  of  the  form 

P 

Y  =  tl  +  f  H  ’  +  +  c  (1.1) 

nxl  r  r 

r=l 

where  Y  is  an  n  x  1  vector  of  observations,  u  is  an  overall  mean 

effect  or  Intercept,  H  are  known  n  x  m  incidence  matrices  where 

jr  r 

HI  *1  and  H  H  »  v  I  (l.e.,  v  is  the  number  of  nonzero 
r  m  xl  nxl  r  r  r  r 

r 

entries  in  a  typical  column  of  H  ),  "t  are  Independent  distributed 
2  r  r 

as  N(0,o  I  ),  X  is  a  known  n  x  s  design  matrix  involving 
r  m 

r 
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possibly  fixed  effects  and  covariates,  ^  is  the  associated  s  x  1 

vector  of  coefficients  and  e  is  an  n  x  1  vector  of  errors 
2 

distributed  N(0,a  1  )  Independent  of  the  t  •  Thus,  Y  N(m,W) 
e  n  r 

where 


m  =  Ul  +  X6 
nxl 


and  W  is  the  patterned  covariance  matrix 


2  r  2  T 

W=cl  +  2rHH 
e  n  r  r  r 

r=l 


2  T  ,  2  2  2  T 

Let  (-  )  Our  primary  interest  Is  in 

el  p 


estimating  the  :  individually  (as  it  has  been  done 
historically)  although  we  shall  say  something  in  Section  3  about 
simultaneous  estimation. 

As  in  Albert  (1976)  we  consider  a  complete  set  of  orthogonal 


projections,  P,P,  ...,P,P,P,  P  ,  2p  +p  +p  +P-=I. 

12  p  e  L  ?  1  i  e  i-  -  n 

T 

In  particular,  P  is  associated  with  the  error,  i.e.,  Y  P  Y  is  the 
e  e 

full  model  error  sum  of  squares-  P^.  is  associated  with  the 

-1  T  -2  - 

Intercept  (Pp  =  '-  1  ),  i.e.,  Y  P,,Y  =  nY.  where  Y.  is  the 

nxn  ^ 

average  of  the  Y's.  SS^  is  the  model  sum  of  squares  for  the 

T  T  -  T  ^  T  ~2 

reduced  ANOVA  model,  i.e.,  SS  =  Y  H(H  H)  H  Y  =  L  y  P  Y  +  nY. 

H  i 

1=1 


where  H  »  (H  H  ...  H  ).  Note  that  we  have  a  sum  of  squares  for 

1  2  p 

each  random  effect.  Finally,  let  squares  for 

the  fixed  effects  and  covarlates  adjusted  for  the  ANOVA,  I.e., 
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T 

SS  I  “  Y  P„Y.  Typically,  Po  Is  Itself  expressed  as  a  sum  of 

p  I H  p  ^ 

orthogonal  pieces. 

According  to  Albert  (1976)  (see  also  Brown  (1984)  and 

Harvllle  (1984)  in  this  regard),  we  have  an  ANOVA  if  and  only  if 

T  T 

for  r  =  1,  2 . p,  H  H  P  =  X  P  ,  k.  =  1,  r,  H  H  P  = 

r  r  k  kr  r’  r  r  P 

T  T 

>  P  ,  H  H  P  =  X  p  and  H  H  P„  =  X  p  where  >  =  0  or  p 

ur  u  r  r  e  er  e  r  r  6  6r  6  -r  r 


T  T 

according  to  whether  or  not  H  P  =  0.  Then  the  Q  =  Y  P  Y  are 

r  -  -  - 

2 

Independent  and  distributed  as  ■  where 

2^2 

-  ■  +  ■  :  ,  f  =  rank(P  ),  =0,  *  =0,  = 

-e  -rr-  -  k  e 

r=l 

T  T 

m  P  m/2  •  ,  ■'  .  =  m  P  .  m/2  ' .. . 

We  note  that  since  for  each  r,  1  >:(H  ),  ’  =  and, 

nxl  r  -r  r 


thus , 


2,2  T  T  „  , 

+  j  ■  •  Since  HP  =0  and  HP  =  0  '  =0  and 

err  e  •  er 


^  2  2 
=  0,  l.e.,  ■  ='  and  •  =  ■'  .  The  i  can  not  be 

:r  e  e  "  e  kr 

determined  explicitly  vicliout  specifyinp  tlu-  desien. 

However,  for  two  random  effects,  with  respective  sums  of  squares 
T  T 

Y  P  Y  and  Y  P  Y,  if  the  latter  is  any  nested  or  crossed  effect 
k  k 

involving  all  the  factors  in  the  former,  then  >  >  <  .  This  is  , 

k  ~  k 

in  fact.  Rule  12  of  Searle  (1971  ,  p.  393).  Obviously,  X  <  >  < 

e  “  k  ~ 

and  typically  there  is  a  partial  ordering  amongst  the  x  , 

k 

Finally,  again  as  in  Searle  (1971,  p.  405),  if  we  define 

T  2  2 

I  -  {'I r.,  i,  ,  ...»  X  )  with  X  =0  we  have  X  =  AX^  where 
0  1  p  0  e 


A  =  r 


1  0 


with  {!’} 

kr 


>  whence 
kr 
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2  -1 

:!  =  A  y. 


(1.2) 


Expression  (1.2)  reveals  a  key  point.  The  variance 

components  are  expressible  as  linear  combinations  of  chi-square 

scale  parameters.  In  fact,  this  expression  is  usually  employed  to 

2  -1 

create  the  familiar  unbiased  estimators  of  the  o  using  f  Q  , 

r  k  k 

the  unbiased  estimator  of  >  . 

2.  IMPROVED  ESTIMATORS  USING  STEIN'S  METHOD 

Consider  estimating  aY  +  by  .  For  appropriate  choices  of 

k  k 

a,  b,  k,  k'  (in  fact  ab  <  0),  this  parameter  will  be  a  variance 
component.  To  proceed  we  utilize  the  following  elementary  lemma 
whose  proof  is  immediate. 

Lemma  2. 1.  Let  be  an  estimator  of  and  let  T^  dominate 

S  under  SEL.  Let  S,  be  an  estimator  of  '  where  is 
1  2  2  2 

independent  of  3,  and  T  .  Then  in  estimating  +  b'  ,  aT  +  bs 
11  1  2’  1  2 
dominates  aS  +  bS  under  SEL  if 
1  2 


ibE,  -  T^)  E,  (S^  -  v^)  2  0 
1  2 


(2.1) 


In  our  applications  we  will  meet  (2.1)  by  having  ab  <  0,  T^  ^  S^, 

<  V 

We  also  require  the  following  result  which  is  a  minor 
generalization  of  a  theorem  stated  and  proved  In  Gelfand  and  Dey 
(1987b). 

2  2 

Theorem  2.1.  Let  S  >  +  S  '  (y  +  :  )X 

-  0  On  1  ^0  ’in  ,■>  ’ 

0  i’  1 

1=1,  ...,  t  all  independent  where  0,  0.  Define  R.  = 

-1  j 

Cj  -  where  c^  =  L  +  2  and  let  =  min(R^,  R^ . R^). 


1=1 


1=1 


Then  in  estimating  under  SEL  >5  «  ^  . . .  «  ^  where 

0  0  12  t 

6  <<  6  means  6  dominates  6  . 

i  j  j  1 

Lastly  we  need  a  lemma  which  appears,  for  example,  in  Klotz, 

Milton  and  Zacks  (1969,  p.  1394). 

Lemma  2.2.  If  T  ^  S  and,  in  estimating  9  >  0,  S  «  T  under 

SEL  then  S  <<  T  ,  under  SEL  where  +  denotes  positive  part. 

These  results  will  be  synthesized  in  the  following  way. 

Assume  ay  +  hy  >  0  and  w.l.o.g.  that  a  >  0,  b  <  0.  This  will 
k  k 

be  the  case  if  ay  +  bY  defines  a  variance  component.  Find  the 
k  k 

set  of  all  Y  >  y  (excluding  y  ,  regardless).  This  set  is 
r  ~  k  k 

nonempty  since  at  the  very  least  Y,  >  y  .  Q  and  the  associated 

~  k  k 

set  of  Q  form  the  S^  and  S^,  respectively,  for  Theorem  2.1  and 

enable  the  creation  of  a  decreasing  sequence  of  estimators  which 

dominate  (f  +2)  Q  ,  the  best  invariant  estimator  of  ;  .  The 
k  k  k 

resultant  suitably  defined  play  the  role  of  S  and  T  in  Lemma 
J  ■  -ill 

2.1  and  will  be  Independent  of  ^  *^k' ’  "  ^ 

Lemma  2.1  holds.  Finally,  using  Lemma  2.2,  [aS^  +  bS^]  << 

[aT^  +  bS^]^. 

Remark  2.1.  Theorem  2.1  allows  for  a  variety  of  improyed 
2 

estimators  for  :  .  Let  S  =  Q  with  S  being  the  Q  , 
e  0  e  i  r 

r  =  1,  ...,  p  as  well  as  Q,  and  Q p.  Then  t  =  p  +  2  and  we  may 

readily  create  ;  .  In  fact,  corresponding  to  any  specified 

permutation,  of  the  there  will  be  a  resultant  ,  l.e., 

there  will  be  tl  such  estimators.  How  might  we  combine  them  to 

produce  a  permutation  Invariant  estimator?  It  can  be  argued  that 

the  minimum  of  these  will  be  "too  small"  and  that  the  average  is  a 

better  practical  choice.  See  Gelfand  and  Dey  (1987b)  for  details. 

We  Illustrate  using  the  one-way  ANOVA,  Y  =  ii  +  a  +  r 

ij  i  i  j 


i  =  1. 


1  1«  •••!  d. 


\  N(0,  a^),  e  -o  N(0,  c^),  all 

i  ^  ij  e 


2  2  2 

Independent.  In  this  case  p  =  1  with  Q  (a  +  J--,,  )x  , 

I  e  “  I-l 
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-  (o;  +  jo;)x5 


2  2 

2 (o  +  Jo  ) 
e  a 


and  Q  p  X-, ,  n 
e  e  T (J- U 


In  estimating  o  we  may  dominate  the  best  invariant 

e  -1 

estimator  =  (l(J-l)  +  2)  using  ~  mlnCR^,  R^)  which  in 

turn  is  dominated  by 


=  mlnCR^,  R^,  R^) 


(2.2) 


or  using  6'  =  min(R  ,  R  ’)  which  in  turn  is  dominated  by 

e ,  1  0  1 


■  '  =  min(R  ,  R  ' ,  R  ) 

e,2  0  1  ’  2 


(2.3) 


Here  R,  =  (IJ  +  1)  +  Q),  R,  '  =  (I(J  -  i;  +  3)  ’‘(Q  +  q  )  and 

i  ell  e  ■■ 

R_  =  (IJ  +  2)(Q  +  Q  +  Q  ).  Estimators  and  appear  in 

2  el-  e,l  e,2  2 

Klotz,  Milton  and  Zacks  (1969).  In  practice,  if  we  suspect  ’ 

small  we  would  use  '  ;  if  we  suspect  ..  small  we  would  use  ' 

e,2  e,2 

and  if  we  have  no  prior  suspicions  we  would  recommend 


(  ,  +  ■  ■  )  '2 
e,2  e,2 


(2. A) 


2  2-1 

Turning  to  •  we  may  write  '  =  J  (  ■  -  ■  )  whence  the  usual 

‘  1  e 

unbiased  estimator  is  given  by  J  [(I  -  1)  ^0  =  (1(J  -  1))  ]. 

1  e 


By  Lemma  2.1  this  is  immediately  dominated  by  J  [(I  +  1)  Q 
-1  1 
(1(J  -  1))  Q  ]  which  in  turn  is  dominated  by  '  = 

J  ^min{(I  +  d'^Q  (I  +  2)"\q  +  Q  )}  -  (I(J  -  1))"^Q  ]. 

1  1  •  e  ^ 

Using  Lemriia  2.2  we  arrive  at  the  positive  part  version  ‘  , 
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Alternatively,  again  by  Lemma  2.1,  the  usual  unbiased 

-1  -1  -1 
estimator  is  dominated  by  J  ((1-1)  Q  -  (I(J  -  1)  +  2)  Q  ] 

1  e 


which  is  dominated  by  J  [(I  +  1)  Q  -  (I(J  -  1)  +  2)  Q  ]  which 

1  e 

is  dominated  by  "  ^  2  ”  '^[min{(I  +  1)  (I  +  2)  +  Q,  )} 

-  (I(J  -  1)  +  2)  Q  ].  Again  by  Lemma  2.2,  we  arrive  at  ' 

+  e  1,2 

The  estimator  r  appears  in  Klotz,  Milton  and  Zacks.  Ncta  that 

'--1 , 1 

while  i  <<  t  since  ^  ‘  we  cannot  conclude  regarding 

a,l  1,2  .1,2  1,1 

+  4- 

and  ■  .  In  fact,  the  simulation  results  in  Section  4  show 

■i  I  1  ^  ^ 

that  neither  dominates  the  other. 

In  concluding  this  section  we  remark  that  utilizing  the  ideas 

in  Gelfand  and  Dey  (1987a)  and  in  Gelfano  (1987),  along  with  the 

a f orment ioned  results,  we  can  Improve  in  the  estimation  of  the 

ratio  ■  /•  .  This  allows  for  Improved  estimation  of,  e.g., 

k  k 


/  .  See  Loh  (1986)  in  this  regard.  We  omit  the  details, 

r  e 

Unfortunately,  we  cannot  extend  this  to,  e.g.,  the  Intraclass 
correlation  coefficient  since  it  is  a  non-linear  function  of  such 
ratios. 


3.  IMPROVED  ESTIMATES  USING  A  GEOMETRIC  MEANS  APPROACH 

In  this  section  we  develop  a  method  for  obtaining  Improved 

estimates  which  arises  from  expression  (1.2),  the  fact  that  the 

variance  components  are  expressible  as  linear  combinations  of  chi- 

square  scale  parameters.  Consider  a  single  which  we  write  as 

r 

be  a  candidate  estimator.  Here 

7 ’  Q  be  dominated  and  what  is  the 
k^k 


2 

'  r 


2  c  Y  and  let  2  f  n 

,  r.  It  k 

k=0  k=0 


we  denote  Q  by  Q  .  When  can 
e  0 


10 


form  of  the  dominating  estimator?  Dey  and  Gelfand  (1987)  discuss 

this  problem  when  the  >  are  scale  parameters  from  arbitrary 

k 

distributions.  Klonecki  and  Zontek  (1985,  1987),  assuming  the  i 

k 

are  scale  parameters  from  Gamma  distributions,  obtain  conditions 

which  enable  assessment  of  linear  admissibility  for  El  Q  ,  l.e., 

k  k 

2 

admissibility  within  the  class  of  linear  estimators  of  ~  .  They 
also  offer  a  slightly  broader  class  of  dominating  estimators  than 
in  Dey  and  Gelfand 

More  precisely,  the  following  result  appears  in  Dey  and 
Gelfand  i  “  i  . 

Theorem  3.1.  Let  Y  f.i  =  1.2 . t.t>2. 

-  i  i  1’  •  >  _  > 


Independent  and  such  that  K.Y  <  .  Consider  the  estimator 


Y  •*-  b  Y 
,  t  i  i 

i  = ;  1  =  1 


(3.1) 


Let  d  =  c  -  a  where  a  =  E(Y 
i  i  i  t  i  1 


.  ;  /K(Y, 

1  1 


=  1  )  and 


d  =  aln  d  ,  d  =  mar.  d  .  Then  (3.1)  dominates  Y 
(  1  )  i  ( t )  i  11 

1  1 

under  SEL  In  estimating  c  •  if  either 

1  1 


(i)  d  >  0  and  0  <  b  <  2td 

(1)  (1) 


(11)  d  <  0  and  2td  ,  <  b  <  0 

(t) 

where  .  =  E(Y  »  1)/E(Y  =1) 

i  1  i  1 

1  =  1 

If  we  denote  by  G(  ^)  the  gamma  density  f^(y)  = 


n 


^,-1  -y/t; 

e 


^  ,  then  a  *  a  +  t  and  0=  .i  +  t  ^ )/  ( j  +  2t  ). 

1,11  ^  1 

”1 

Now  let  D  be  a  diagonal  matrix  whose  diagonal  entries  are  the 

and  let  G  be  of  the  form  1'  where  2  is  a  nonnegative 
i  d 

definite  matrix  such  that  (“)  >  0,  (I)  >  0  and  £  is  a 

Ij  ~  il  d 

diagonal  matrix  whose  diagonal  entries  are  Then  Klonecki 

and  Zontek  (1985)  show; 

Theorem  3.2.  If  1  =  1  ...  t  independent,  then 

r  V  is  linearlv  admissible  for  c  If  and  only  if  there  exists 
i'  1  '  11 

a  matrix  G  of  the  above  form  such  that  (I  +  GD)  =  Gc  where 

<;  ...,  ^),  c  =  (c^,  ....  c^). 

Theorem  3.1  Is  often  too  restrictive.  If  instead  we  allow  a 


more  general  product  Y  we  can  choose  q  to  achieve  suitably 

i  j 

defined  "d^"  all  having  the  same  sign.  In  fact,  for  a  specified 
set  of  q  >0  such  that  q  -  1  Klonecki  and  Zontek  (1987),  again 

j  ~  j 

for  gamma  distributions,  provide  necessary  and  sufficient 

conditions  for  the  existence  of  an  estimator  of  this  form  which 

Improves  upon  ■  ^Y^.  We  state  a  version  of  tlieir  Lemma  1  which  is 

in  a  form  parallel  to  Theorem  3.1. 

Theorem  3.3.  if  Y  G(  <  ,  '‘  )  1  =  1  . .  .  t  independent  there 

-  ill 

exists  b  0  such  that  the  estimator 


■  Y  +  biY 
i  i  i  j 


(3.2) 


where  q  >0,  iq  =  1,  dominates  y  under  SEL,  in  estimating 

j  “  j  11 


-c^!^  if  and  only  If  either  (1)  or  (il)  below  holds.  Define  d^*  = 


12 


c  - 

(a  + 

q 

)'  .  d  * 

=  min 

d 

* 

and 

i 

1 

1  i’  (1) 

1 

i 

(1) 

d  * 

>_ 

o 

H 

O 

o 

if  d* 

as 

0 

and 

(1) 

1 

i 

(ii) 

d  * 

< 

0,  q  =  0 

if  d* 

0 

and 

(t) 

1 

1 

a 

+  q . 

1 

1  : 

where 

b*  = 

2 

n  (-i 

_ 

t  ej  _ 

) 

1 

1-" 

{j:d.^iO}  j 
J 


(In  the  gamma  case  w^(q)  =  +  q). 


Remark  3.1.  Theorem  3.3  holds  more  generally  than  for  the 

gamma  family.  Its  proof  only  requires  specification  of  the 

1+q  I  -  ,  q 

in'-reaslng  functions  w  (q)  ^  E(Y  =  1)/E(Y-  =  1). 

i  i  i  ^  i 

Given  w  we  can  characterize 
i 

the  sets  of  q'.s  which  make  the  corresponding  d* ' s  all  have  the 
J  i 

same  sign,  thus  enabling  domination  by  (3.2). 

Remark  3.2.  The  bounding  of  the  risk  difference  in  Theorem 

3.1  is  not  as  sharp  as  is  possible  under  the  Gamma  assumption  in 

Theorem  3.3;  hence,  the  resulting  bounds  on  b  in  Theorem  3.3  when 

all  q^  are  equal  are  more  liueral  than  those  in  Theorem  3.1. 

2 

Returning  to  the  estimation  of  a  variance  component  = 


c  •  consider  the  estimator  ■  Q  where 
,  k  k  k  k 

k=0  k=0 


.(f  +  2-:  ) 

k  k  k 


-1 


0  < 


<  1.  The  terms  ■  Q  range  from  the 
k  ~  k  k 

unbiased  to  the  best  invariant  estimator  of  c  -  as  ranges  from 
2  k  k  k 

0  to  1 .  Since  r 

k  f 

k 


is  G(.  fj,,  2 


If  c  -  0  we  must  set  q 
k  k 


0. 


(3.3) 

2 


Thus,  if  r  does  not  appear  in 

k  2  r  ■ 

using  Theorem  3.3,  Q  does  not  help  in  estimating  '  . 

k  °  r 


This 


clearly  differs  from  the  approach  in  Section  2  where,  for  example. 
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in  estimating  o  all  the  Q  can  be  used  to  improve  upon  the  best 
e  k 

invariant  estimator.  Note  that  with  2  as  defined  above,  from 

k 

(3.3),  the  sign  of  d*  depends  only  upon  sgn(c  (e  -  q  ));  for 
k  k  k  k 

specified  e  and  q  the  magnitude  of  c  does  not  play  a  role  with 
k  k  k 

respect  to  whether  an  estimator  of  the  form  (3.2)  can  dominate. 

From  (3.3)  if  all  e  =  0  or  all  e  =  1  this  approach  will 
k  k 

provide  a  dominating  estimator  if  and  only  if  at  least  two  c 


differ  from  0  and  all  nonzero  c  have  the  same  sign.  For  a 

k 

variance  component  some  pair  of  c  will  have  opposite  signs. 

k 

Therefore,  a  dominating  estimator  will  not  be  obtained  if  for  any 
such  pair  both  t  's  are  0  or  both  c's  are  1.  If  we  can  choose  q 

k 

to  make  ^  0  then  the  dominating  estimator  in  (3.2)  will  be 

a  "shrinker."  Hence,  using  Theorem  2.3,  the  positive  part  of 


+ 

(3.2)  will  dominate  [  ■  Q  ]  . 

k  k 

k=0 

In  this  spirit  it  Is  natural  to  ask  whether  the  approach  of 
this  section  can  be  combined  with  that  of  the  previous  section. 

Can  we  improve  upon  the  estimators  developed  through  Lemma  2.1  and 
Theorem  2.1  using  a  more  general  version  of  Theorem  3.3  as 
suggested  in  Remark  3.1?  The  answer  appears  to  be  no  since  in  the 
notation  of  Theorem  2.1  is  not  a  scale  parame  ,er  for  the 
distribution  of  '  The  reader  might  suggest  that  could  be 
viewed  as  a  scale  parameter  for  the  distribution  of  f  under 

j 

suitable  conditioning.  Following  the  argument  leading  to  Theorem 
3.3,  while  b  must  be  chosen  unconditionally,  it  would  have  to 
provide  improvement  at  each  conditional  level.  We  can  readily 
show  that  even  in  the  simplest  case,  t  =  2,  no  b  unequal  to  0  can 
achieve  this. 

As  an  example,  we  turn  again  to  the  one-way  ANOVA  using  the 

2  -1 

notation  in  Section  2.  Recalling  a  =  J  (  y,  ~  y  )  we  consider 

1  e 

dominating  the  estimator 
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-  1  +  2e^)  -  (I(J  -  1)  +  2c^)  (3.4) 

Thus,  d*>Oasq  <t,d*>Oasq  As  noted  above, 

1<  l>le<  e<e 

this  approach  unfortunately  does  not  provide  a  dominating 

estimator  in  the  two  Important  cases  where  ^  =0  and  where 

1  e 

£  =  e  =1.  Instead,  we  take 

1  e 

(1)  ^  =  1,  “  =0  for  which  any  q  ,  q  ,  >  0,  q  +  q  =  1  work 

L  e  I  Z  i  ^ 


and  (3-2)  becomes 
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J  ^[(I  -  -  (I(J  -  1)  +  2)'^Q  ]  +  b  Q  ®  (3.6) 

1  e  1  e 

We  conclude  this  section  with  a  remark. 

Remark  3.3.  Results  applicable  to  the  simultaneous 

estimation  of  variance  components  under  unweighted  SEL  are  given 

In  Kloneckl  and  Zontek  (1987).  In  particular,  extensions  of 

Theorems  3.2  and  3.3  are  given  for  the  estimation  of  a  vector  C 
T 

using  L  Y.  The  special  case  C  =  I  (not  of  Interest  here)  has  been 
extensively  discussed.  See,  e.g.,  Berger  (1980),  Das  Gupta 
(1986),  Dey  and  Gelfand  (1987),  and  Das  Gupta,  Dey  and  Gelfand 
(1987). 


4.  SIMULATION  RESULTS 

In  the  one-way  ANOVA  we  studied  Improved  estimation  of  both 
2  2 

and  ■  by  undertaking  a  substantial  simulation  study  over 
e  "  2  2 

various  values  of  I,  J,  i.,  r  and  :  .  Each  case  received  10,000 

e 

replications.  Even  with  so  many  replications,  resimulation  of 

particular  cases  suggests  that  the  stated  percent  improvements 

(Pi's)  will  only  be  accurate  within  2%. 

2 

In  estimating  '  some  selected  cases  are  presented  in  Table 
e 

1.  In  this  table  PI  is  relative  to  the  best  invariant  estimator 

given  above  (2.2).  Not  surprisingly  (2.2)  outperforms  (2.3)  when 

2 

is  small,  and  vice  versa  when  c.  Is  small.  The  estimator  (2.4) 

2  2, 

seems  like  a  good  compromise.  For  fixed  (u,  c, ,  -  ),  PI  s 

a  e 

Increase  in  I,  decrease  In  J.  Although  the  Pi's  are  small  the 
fact  that  (2.2)-(2.4)  are  so  simple  to  calculate  encourages  their 
use . 

2 

In  the  estimation  of  o  ,  the  reference  estimator  is  the 

a 

positive  part  of  the  unbiased  estimator. 


+ 

/J 


(4.1) 
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TABLE  1 

PERCENT  IMPROVEMENTS  IN  ESTIMATING  a 


2 


e 


(u,  c^,  0^) 

(2.2) 

PI  for 

(2.3) 

(2.4) 

I  =  2,  J  =  5 

(0,  1,  10) 

2.60 

2.39 

2.65 

(1.  .1,  10) 

1.95 

2.27 

2.28 

(1,  1.  1) 

1.39 

0.78 

1.24 

I  =  5,  J  =  5 

(0.  1.  10) 

4.09 

2.98 

3.72 

(1,  .1,  10) 

3.64 

3.36 

3.95 

(1.  1.  U 

0.72 

0.15 

0.50 

I  =  10,  J  =  5 

(0.  1,  10) 

4.97 

3.86 

4.53 

(1,  .1,  10) 

6.09 

3.92 

5.69 

(1.  1.  1) 

0.15 

0.03 

0.10 

As  shown  in  Section  2,  (4.1)  is  dominated  by 


•I  +1  I(J  -  1)' 


/J 


(4.2) 


and  by 


I  +  1 


I(J  -  1)  +  2 


l/J 


(4.3) 


Neither  of  (4.2)  and  (4.3)  dominates  the  other.  However,  from 

Section  2,  6  ^dominates  (4.2),’'  dominates  (4.3).  For  b 
G , 1  a, 2 

sufficiently  small  (3.5)  dominates  (4.2)  Ignoring  the  positive 
parts.  With  positive  parts  applied  to  both  estimators  this  is  no 
longer  true.  Since  there  is  no  obvious  optimal  choice  we  took  b 
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at  the  middle  of  the  allowable  range. 

j  +  + 

In  Table  2  we  compare  ^  and  6  with  (4,1).  We  see 

0,1  0,2 

enormous  improvement  for  both,  that  the  Pi's  are  essentially 

indistinguishable  and  that  neither  of  the  6's  dominates  the  other. 

We  would  draw  the  same  conclusions  in  the  comparison  of  (4.2)  and 

2  2 

(4.3)  with  (4.1).  Of  course,  if  o  >  c  then  6  will  tend  to 

a  e  a,l 

be  nonnegative  whence  the  domination  result  in  Section  2  argues 

r  +  + 

for  °  .  Turning  to  a  comparison  of  <5  with  (4.3)  we  see 

0*2  a, 2 

that  if  u  is  small  the  gain  may  be  substantial.  A  comparison  of 
+ 

6  with  (4.2)  would  yield  essentially  the  same  magnitudes  of 

01,1 

improvement.  Again,  since  these  estimators  are  so  simple  to 

calculate,  their  use  is  encouraged.  Finally,  the  comparison  of 

2 

the  positive  part  of  (3.5)  with  (4.2)  is  discouraging  when  ^  is 
2 

smaller  than  '  .  Modest  improvement  will  usually  occur  when 
2  2  e 

>  -■  .  This  Is  reasonable  since  then  the  positive  part 
e 

modification  is  rarely  applied  and  the  dominance  result  comes  into 


2  + 

We  conclude  by  recommending  (2.4)  for  •  and:  for 

e  x,2 
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TABLE  2 

2 

PERCENT  IMPROVEMENTS  IN  ESTIMATING 


(u.  . 

2  2x 

°e) 

+ 

6  ,  vs 

a,l 

(4.1) 

PI' 

+ 

a, 2 

(4.1) 

's 

+ 

6  ,  vs 

a, 2 

(4.3) 

(3.5)  vs 

(4.2) 

I  =  2,  J  =  5 

10, 

1.  1) 

92.22 

91.96 

17.67 

-53.73 

(0, 

1,  .1) 

66.49 

66.69 

1.10 

3.19 

(0, 

1.  u 

70.61 

71.31 

2.10 

3.78 

(1, 

.1,  1) 

90.80 

90.32 

1.87 

-53.68 

(1. 

1, 

•  1) 

66.44 

66 .64 

.90 

3.29 

(1. 

1, 

1) 

71.51 

72.14 

1.74 

3.44 

I  =  5,  J  =  3 

(0, 

•  1, 

,  1) 

68.55 

67.72 

14.90 

-172.38 

(0, 

1.  .1) 

33.33 

33.61 

1.57 

6.46 

(0, 

1, 

1) 

35.24 

37.03 

0.03 

-174.08 

(1, 

.1) 

32.08 

32.33 

0.63 

6.56 

(1. 

i,  u 

33.42 

35.08 

0.71 

3.59 

I  =  10,  J  =  5 

(0, 

•  1 

,  1) 

44.49 

43.92 

8.73 

-336.81 

(0, 

1, 

.1) 

19.55 

19.73 

1.35 

3.77 

(0, 

1) 

19.39 

20.88 

1.33 

-27.04 

(1. 

.1,  1) 

40.47 

39.21 

0.00 

-338.97 

(1, 

1,  .1) 

18.81 

18.91 

0.10 

3.60 

(1. 

1.  1) 

18.39 

19.68 

0.20 

-25.85 
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